CDS

Accession Number TCMCG033C20122
gbkey CDS
Protein Id TQD94286.1
Location complement(join(501624..502220,502871..503303,503394..504262,504403..505197,505914..506131,506477..506840))
Organism Malus baccata
locus_tag C1H46_020100

Protein

Length 1091aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA428857, BioSample:SAMN08323692
db_source VIEB01000347.1
Definition hypothetical protein C1H46_020100 [Malus baccata]
Locus_tag C1H46_020100

EGGNOG-MAPPER Annotation

COG_category O
Description Ubiquitin-activating enzyme E1
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
ko04147        [VIEW IN KEGG]
KEGG_ko ko:K03178        [VIEW IN KEGG]
EC 6.2.1.45        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04120        [VIEW IN KEGG]
ko05012        [VIEW IN KEGG]
map04120        [VIEW IN KEGG]
map05012        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAGATTTTGGTGTTTCGGCAGTTCATCGCACTATATGCTTCCTAGAAAGAGAGAAGCAGTGGGCGGAGAGGTTGTAGTGAACGAAGCCACGGAATCTCCGATCAAGAAGCCTCGCGGAGCTGCGACTACTGACGATTCTAAGAGCAACGAAAACAAAAACACCACCACCTTTAACAACAACAGCAACAGCAGTAGCAGCATCGAAGAATCTCCAATCATGGCATTGGGCAATGGGAGCTCAAACGATATCGATGAAGATCTCCACAGCCGGCAGTTGGCGGTTTATGGCAGGGAGACCATGCGCCGGCTCTTTGCCTCCAACATTCTCGTCTCGGGGATGCAGGGCTTGGGCGCTGAGATTGCAAAAAACCTTGTTCTTGCCGGTGTCAAGTCTGTGACTTTGCATGATGATGGAGTTGTTGAGTTGTGGGACTTGTCAAGCAACTTCTTCTTTTCTGAGGAGGATGTTGGAAAGAATCGGGCCCTTGCCTGTGTTCAAAAATTACAAGAACTAAACAACGCTGTTGTCATTTCGACCATAACAACTGAACTGACTAAAGAGAAACTCTCTGATTTCCAGGCTGTAGTTTTCACTGATATTAGCTTGGAGAAAGCAATTGAATTCGATGACTTCTGCCATAATCATAACCCTCCCATCTCTTTTATAAAATCAGAAGTACGAGGCCTTTTTGGCAGCGTATTTTGTGACTTTGGCCCTGAGTTCACTGTTCTTGATGTTGATGGTGAGGATCCACATACAGGTATAATTGCATCGATTAGTAATGACAACCCTGCTCTCATAGCATGTGTTGATGATGAGAGGCTTGAGTTCCAGGATGGGGATTTAGTTGTGTTCACCGAAGTCCATGGAATGACTGAGTTGAATGATGGAAAGCCGAGGAAGGTTAAGAATGCAAGACCTTACTCATTCACAATCGAGGAGGACACTACCAATTATGCTGCATATGAGAAAGGGGGTATAGTCTCACAGGTGAAGCAACCCAAGGTGTTGAACTTTAAGCCTCTGCGAGAAGCACTCAAGGATCATGGTGACTTCCTTCTGAGTGACTTCTCCAAGTTTGATCGCCCACCTCTCCTACACTTGGCATTTCAGGCACTAGATAAGTTCATTTCAGAGTTGGGGCGTTTCCCTGTTGCTGGATCTGAGGATGATGCTACAAAATTCATATCTATGGTAACTAACATCAATGATAGCTCAGCAGATGGTAAGCTTGAGGAGATTGACCACAAGGTTCTTCATCATTTTGCATTTGGTGCTAGAGCTGTGCTAAATCCCATGGCTGCTATGTTTGGTGGTATTGTTGGACAAGAAGTTGTGAAGGCCTGTTCTGCTAAGTTCCATCCTCTTTTCCAGTTTTTCTACTTCGATTCAGTTGAATCCCTTCCTTCAGAGACCTTGGATCCTAATGATTTGAAGCCTTTGAATAGCCGATATGATGCACAAATTTCAGTATTTGGAGCCAAGCTCCAGAAAAAGCTGGAGGACGCAAAAGTGTTCACTGTCGGATCTGGGGCACTAGGATGTGAGTTTTTGAAGAATTTAGCATTGATGGGTGTCTCTTGTGGTAAAGAAGGGAAGTTAACAATAACAGATGATGATGTCATTGAGAAGAGTAACCTTAGCAGGCAATTTCTCTTTAGAGATTGGAACATTGGGCAGGCTAAATCAACAGTAGCTGCCTCTGCTGCTATGTTGATAAATGGTCGTCTGAACATTGAAGCACTGCAGAATCGTGCAAGCCCAGATACTGAAAATGTGTTTGATGATACTTTTTGGGAGAATTTGGATGTTGTTATCAATGCTCTGGATAATGTGAATGCAAGGCTTTACATTGATCAGAGATGCTTGTACTTCCAGAAGCCACTTTTGGAATCAGGAACTCTAGGTGCCAAGTGCAACACGCAGATGGTCATTCCTCACCTGACTGAAAATTATGGAGCATCACGGGACCCACCTGAAAAGCAAGCACCCATGTGCACAGTTCATTCATTCCCTCACAACATTGACCATTGCTTGACATGGGCCCGGTCTGAGTTTGAGGGTCTACTTGAGAAGGTGCCAGCTGAAGTAAATGCCTACCTGACTAATCCTAATGAATACATTGCTGCCATGAAAAATGCTGGTGATGCTCAGGCCAGGAACAACTTGGAAAGTGTCATTGAATGCCTTGACAAAGAGAGATGTGAGACGTTCCAAGATTGCATAAGCTGGGCCCGTCTGAAGTTTGAGGACTACTTTGCTAACCGTGTGAAGCAGTTAACTTATACTTTTCCTGAGGATGCTACAACCAGTAGCGGAACACCATTTTGGTCTGCCCCCAAGCGTTTTCCTCGCCCCCTGCAGTTCTCAGTTGATGATCTCAGCCACCTCCAGTTTTTAATGGCAGCATCTATACTACGAGCAGAGACATTCAACATTCCAATTCCTGATTGGGTCAAGTCTCGTGCAAAGTTTGCTGATGCTGTTAACAAAGTGATGGTACCAGATTTTCAGCCTAAGAAAGATGTGAAGATTGAGACTGATGAGAAAGCTACAACCGTCTTACCAGCATCGATTGATGATGCTGCGGTTATCAACGAGTTAGTTGTGAAGTTAGAAAGATGCAAGGAGCGGCTGCCACCAGGCTTCAAGATGAACCCGATTCAGTTTGAGAAGGATGATGATACGAACTATCATATGGACCTAATAGCTGGATTTGCAAATATGAGGGCAAGGAACTATGGCATTGGTGAAGTCGACAAACTGAAAGCCAAATTCATTGCAGGAAGAATCATTCCTGCAATTGCAACATCAACTGCTCTGGCAACCGGCCTTGTCTGCTTGGAGCTGTACAAGGTTCTGGACGGAGGGCACAAGGTTGAAGACTACCGAAACACCTTTGCCAATCTCTCCCTTCCTCTGTTCTCCATGGCTGAACCAGTCCCTCCTAAAGTCATCAAGCACCAAGATATGAAGTGGACAGTATGGGACAGGTGGATTATAAAGGACAACCCAACTTTAAAGCAGCTCCTCAAGTGGCTCGAAGACCAGGGCTTAAACGCGTACAGCATCTCCTATGGAAGTTGCCTGCTCTTTAACAGCATGTTCCCGAAGCACAAGGAGCGCATGGACAGGACCATGGTGGATTTGGCTACGAGTATTGCGAAGGCAGAACTGCCTGCTAATAGGAAGCACTTTGACGTGGTGGTGGCTTGCGAAGATGAAGAAGAGAACGATATTGATATCCCGCAAATCTCCATTTACTTCAAGTAG
Protein:  
MRFWCFGSSSHYMLPRKREAVGGEVVVNEATESPIKKPRGAATTDDSKSNENKNTTTFNNNSNSSSSIEESPIMALGNGSSNDIDEDLHSRQLAVYGRETMRRLFASNILVSGMQGLGAEIAKNLVLAGVKSVTLHDDGVVELWDLSSNFFFSEEDVGKNRALACVQKLQELNNAVVISTITTELTKEKLSDFQAVVFTDISLEKAIEFDDFCHNHNPPISFIKSEVRGLFGSVFCDFGPEFTVLDVDGEDPHTGIIASISNDNPALIACVDDERLEFQDGDLVVFTEVHGMTELNDGKPRKVKNARPYSFTIEEDTTNYAAYEKGGIVSQVKQPKVLNFKPLREALKDHGDFLLSDFSKFDRPPLLHLAFQALDKFISELGRFPVAGSEDDATKFISMVTNINDSSADGKLEEIDHKVLHHFAFGARAVLNPMAAMFGGIVGQEVVKACSAKFHPLFQFFYFDSVESLPSETLDPNDLKPLNSRYDAQISVFGAKLQKKLEDAKVFTVGSGALGCEFLKNLALMGVSCGKEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAMLINGRLNIEALQNRASPDTENVFDDTFWENLDVVINALDNVNARLYIDQRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKVPAEVNAYLTNPNEYIAAMKNAGDAQARNNLESVIECLDKERCETFQDCISWARLKFEDYFANRVKQLTYTFPEDATTSSGTPFWSAPKRFPRPLQFSVDDLSHLQFLMAASILRAETFNIPIPDWVKSRAKFADAVNKVMVPDFQPKKDVKIETDEKATTVLPASIDDAAVINELVVKLERCKERLPPGFKMNPIQFEKDDDTNYHMDLIAGFANMRARNYGIGEVDKLKAKFIAGRIIPAIATSTALATGLVCLELYKVLDGGHKVEDYRNTFANLSLPLFSMAEPVPPKVIKHQDMKWTVWDRWIIKDNPTLKQLLKWLEDQGLNAYSISYGSCLLFNSMFPKHKERMDRTMVDLATSIAKAELPANRKHFDVVVACEDEEENDIDIPQISIYFK